Protein Structure Abstractionand Automatic Clustering Using Secondary Structure Element Sequences

نویسندگان

  • Sung-Hee Park
  • Chan Yong Park
  • Dae-Hee Kim
  • Seon Hee Park
  • Jeong Seop Sim
چکیده

To study protein clustering is very important in diverse fields such as drug design and environmental industry. For a meaningful clustering, protein structure must be considered. But, protein structures are very complicated and have so much information such as angles, 3-dimensional coordinates. Thus, it is not easy to efficiently compute their relations. In this paper, we present a method to efficiently abstract and cluster protein structures using secondary structure element sequences. Since a secondary structure element sequence is an abstract representation of protein structure, it can be regarded as a useful de-representation of protein structure, it can be regarded as a useful descriptor to cluster a set of proteins at the abstraction level. Using secondary structure element sequences and their distances, we implemented an automatic protein clustering system and verify their efficiency by experimental results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches

DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...

متن کامل

Relation Between RNA Sequences, Structures, and Shapes via Variation Networks

Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...

متن کامل

FTIR Investigation of Secondary Structure of Reteplase Inclusion Bodies Produced in Escherichia coli in Terms of Urea Concentration

Recent studies suggest that reducing the induction temperature would improve the quality of some recombinant inclusion bodies (IB) by providing a native-like secondary structure and leading to an improvement in protein recovery. This study focused on optimizing the solubilization condition of Reteplase, a recombinant protein with 9 disulfide bonds. The influence of lowering induction temperatur...

متن کامل

FTIR Investigation of Secondary Structure of Reteplase Inclusion Bodies Produced in Escherichia coli in Terms of Urea Concentration

Recent studies suggest that reducing the induction temperature would improve the quality of some recombinant inclusion bodies (IB) by providing a native-like secondary structure and leading to an improvement in protein recovery. This study focused on optimizing the solubilization condition of Reteplase, a recombinant protein with 9 disulfide bonds. The influence of lowering induction temperatur...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005